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Abstract 

Suppose G is a graph with degrees bounded by d, and one needs to remove more than 
en of its edges in order to make it planar. We show that in this case the statistics of local 
neighborhoods around vertices of G is far from the statistics of local neighborhoods around 
vertices of any planar graph G' with the same degree bound. In fact, a similar result is proved 
for any minor-closed property of bounded degree graphs. 

As an immediate corollary of the above result we infer that many well studied graph prop- 
erties, like being planar, outer-planar, series-parallel, bounded genus, bounded tree-width and 
several others, are testable with a constant number of queries, where the constant may depend 
on e and d, but not on the graph size. None of these properties was previously known to be 
testable even with o{n) queries. 

1 Introduction 

Suppose we are given an n-vertex graph of bounded degree and are asked to decide if it is planar. 
This problem is well known to be solvable in time Q(n) [25j. But suppose we are only asked to 
distinguish with probability 2/3 between the case that the input is planar from the case that an e- 
fraction of its edges should be removed in order to make it planar. Can we design a faster algorithm 
for this relaxed version of the problem, with running time o(n)? A special case of our main result 
is that in this case we can even design an algorithm whose running time is a constant that depends 
only on e (and the bound on the degrees) and is independent of the size of the input. 

1.1 Background on property testing 

Before stating our main result, which is a significant generalization of the above mentioned result 
on planarity testing, let us briefly introduce the basic notions in the held of property testing. The 
meta problem is the following: given a combinatorial structure S, distinguish if S satisfies some 
property or if 5" is e-far from satisfying V, where S is said to be e-far from satisfying V if an 
e-fraction of its representation should be modified in order to make S satisfy V. The main goal is 
to design randomized algorithms, which look at a very small portion of the input, and using this 
information distinguish with high probability between the above two cases. Such algorithms are 
called property testers or simply testers for the property V. Preferably, a tester should look at a 
portion of the input whose size is a function of e only. Blum, Luby and Rubinfeld [6j were the first 
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to formulate a question of this type, and the general notion of property testing was first formulated 
by Rubinfeld and Sudan [37j . 

The main focus of this paper is testing properties of graphs in the bounded degree model, which 
was introduced by Goldreich and Ron [22]. In this model, we fix a degree bound d and represent 
graphs using adjacency lists. More precisely, we assume that a graph G is represented as a function 
fc '■ [n] X [d] [n] U {*}, where given a vertex v £ V{G) and 1 < i < d the function f{v, i) returns 
the i^^ neighbor of v, in case v has at least i vertices. If v has less than i vertices then f{v,i) = *. 
A graph of bounded degree d is said to be e-far from satisfying V if one needs to execute at least 
edn edge operations of deleting or adding an edge to G in order to turn it into a graph satisfying 
V. (Since d is treated as a constant in this paper, the edn above is just proportional to en.) 

A testing algorithm (or tester) for graph propertj{l| V and accuracy e is a (possibly randomized) 
algorithm that distinguishes with probability at least 2/3 between graphs satisfying V from graph 
that are e-far from satisfying it. More precisely, if the input graph satisfies V the algorithm accepts 
it with probability at least 2/3, where the probability is taken over the coin tosses of the tester. 
Similarly, if it is e-far from satisfyirig V, the algorithm should reject it with probability at least 
2/3. The tester is given n as inpulo and is provided with access to the function /q as a black 
box. We define the query complexity qT{n) of the tester T as the maximal number of /c-calls that 
the tester can execute on any graph G with n vertices. Finally, we define the following notion of 
efficient testing: 

Definition (Testable). A graph property V is testable if for every e > there is some constant 
c(e) < cxD and a tester T that tests V with accuracy e and satisfies sup^qTin) < c(e) for every n. 
In other words, if for any e > 0, there is a constant time randomized algorithm that can distinguish 
with high probability between graphs satisfying V from those that are e-far from satisfying it. 

1.2 Minor-closed properties and the main result 

Our main result deals with the testing of minor closed graph properties. Let us briefly introduce 
the basic notions in this area, which is too rich to thoroughly survey here. For more details, see 
Chapter 12 of [H] and the recent article of Lovasz [31] on the subject. A graph H is said to a minor 
of a graph G, if H can be obtained from G using a sequence of vertex removals, edge removals and 
edge contraction^. If H is not a minor of G then G is said to be H -minor free. The topic of graph 
minors is among the most (or perhaps the single most) studied concepts in graph theory. Our main 
result in this paper is accordingly the following: 

Theorem 1.1 (Main Result). For every (finite) graph H, the property of being H -minor free is 
testable. More generally, every minor-closed graph property is testable. 

Recall that a graph property V is minor-closed if every minor of a graph in V is also in V, or 
equivalently if V is closed under removal of edges, removal of vertices and contraction of vertices. 

Perhaps the most well-known result in the area of graph minors is the Kuratowski- Wagner 
Theorem \28\ 142] . which states that a graph is planar if and only if it is iTs-minor free and K^^^- 
minor free. This fundamental result raised the natural question if a similar characterization, using a 

^As usual, a graph property is simply a family of graphs closed under graph isomorphism. 
■^Our tester needs n as the input only to be able to pick a vertex at random from the tested graph. 
^Contracting an edge connecting vertices u, v is the result of replacing u and u by a new vertex w, and connecting 
w to all the vertices that were connected to either u or v. Loops and multiple edges are removed. 
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finite family of forbidden minors, also holds for embedding graphs in other fixed surfaces? Observe 
that a graph remains planar if we remove one of its edges or vertices and if we contract one of 
its edge. In fact, this closure property under these three basic operations holds for the property 
of a graph being embeddable in any specific surface. Thus the property of being embeddable in 
a specific surface is minor-closed. In one of the deepest results in graph theory, Robertson and 
Seymour proved the so called Graph- Minor Theorem }34], which states that for every minor-closed 
graph property V, there is a finite family of graphs Ti-p such that a graph satisfies V if and only if it 
is H-vuvnoT free for all H S Ti-p. Note that this in particular answers the above mentioned problem 
regarding the characterization of graphs embeddable in a fixed surface using a finite number of 
forbidden minors. 

We note that besides the above (mainly) graph theoretic motivation, the graph minors and minor 
closed properties have also received a considerable amount of attention by the Computer-Science 
community, because many natural graph properties (e.g., planarity) are described by excluded 
minors, and graph problems that are NP-hard in general can often be solved on excluded minor 
families in polynomial time, or at least be approximated better than on arbitrary graphs. See [11] 
and its references. 

Let us mention some well studied minor-closed graph properties. Of course, the most well 
known such property is Planarity. A well-studied variant of planarity is Outer-planarity, the 
property of being embeddable in the plane in such a way that all vertices lie on the outer face. 
Another generalization of planarity is being embeddable in a surface of genus at most k. Graphs 
satisfying this property are said to have genus k. The Tree-width of a graph is one of the most 
important invariants of graphs. This notion, which measures how close a graph is to being a tree, 
was introduced by Robertson and Seymour as part of their proof of the Graph-Minor Theorem; see 
[7]. It also has numerous applications in the area of fixed-parameter algorithms; see |17] for more 
details. As it turns out, having bounded tree-width is also a minor-closed graph property. Another 
well-known minor-closed property is being Series-parallel. Series parallel graphs are graphs that 
can be obtained from a single edge by sequence of parallel extensions (adding an edge parallel to an 
edge that already exists) and series extensions (subdividing an edge by a new node). We conclude 
with the property of being Knotlessly-embeddable, which is the property of being embeddable 
in in a way that no two cycles are linked and no cycle is knotted. 

By Theorem 11.11 we infer that the above mentioned properties are all testable with a constant 
number of queries. It is interesting to note that prior to this work none of the above properties was 
even known to be testable with qr{n) = o{n) queried. 

One important aspect of Theorem 1 1.1^ which we have neglected to address thus far, is the actual 
dependence of the query-complexity of the testers on e. As it does not seem like one can achieve 
a query complexity that is sub-exponential in 1/e using our approach, we opted in several places 
not to give explicit bounds. We further discuss this issue in Section [5l where we show how can one 
derive explicit (but rather large) upper bounds on the query complexity of the testers as a function 
of e. 

^The conference version of |22j announced a tester for planarity, which eventually turned out to be wrong. 
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1.3 Hyper-finiteness and testing monotone hyper-finite properties 

The following notion of hyper-finiteness was defined by Elek [18j (though it is implicit in [30j, for 
example) . 

Definition (Hyper-Finite). A graph G = (y,E) is (5, k) -hyper- finite if one can remove S\V\ edges 
from G and obtain a graph with connected components of size at most k. A collection of graphs 
Q is hyper-finite if for every 5 > there is some finite k = k{6) such that every graph in Q is 
{5, k) -hyper- finite. 

Our main theorem, Theorem II .H will be a corollary of the following more general result. 
Theorem 1.2. Every monotone hyper-finite graph property is testable. 

Recall that a graph property V is monotone if every subgraph of a graph in V is also in V. 

For example, consider the property that the number of vertices at distance r around every vertex 
is at most some fixed function g{r) satisfying g{r) = 2°'-''^ as r — > oo. Theorem 11.21 can be used to 
show that this property is testable (but this does not seem to follow directly from Theorem II. ip . 
Elek [19j studied property testing within this class of graphs. 

1.4 Comparison to previous results 

Our main result in this paper deals with testing properties of bounded degree graphs. Let us 
briefly mention some results on testing properties of dense graphs, a model that was introduced by 
Goldreich, Goldwasser and Ron [21]. In this model, a graph G is said to be e-far from satisfying a 
property V, if one needs to add/delete at least en? edges to/from G in order to turn it into a graph 
satisfying V. The tester can ask an oracle whether a pair of vertices, say i and j, are adjacent in the 
input graph G. It was shown in [21] that every partition problem like /c-colorability and Max-Cut 
is testable in this model. Alon and Shapira [3] have shown that every hereditary graph propertjH is 
testable in dense graphs. This also gave an (essential) characterization of the graph properties that 
are testable with one-sided erroilfl. A characterization of the properties that are testable in dense 
graphs was obtained by Alon et al. [IJ. Note that in this model (as its name suggests) we implicitly 
assume that the input graph is dense, because the definition of e-far is relative to n^. Therefore, 
some properties are trivially testable in this model. In particular, minor-closed properties are 
trivially testable in this model with 0(l/e) queries and even with one-sided error. This follows 
from the result of Kostochka and independently Thomason [261 [271 BOl SI] that every finite graph 
with average degree at least Q.{r^/\ogr) contains every graph on r vertices as a minor. Therefore, 
every large enough finite graph with Q.{n'^) edges does not satisfy a minor-closed property. 

As the above mentioned results indicate, testing properties in dense graphs is relatively well 
understood. In sharp contrast, there were no general results on testing properties in the bounded- 
degree model. For example, while every hereditary property is testable in dense graphs, in bounded 
degree graphs some properties are testable (e.g. being triangle-free [22j), some require 0(\/n) queries 
(e.g. being bipartite [22l|23]) and some require 0(n) queries (e.g. 3-colorability [8]). Besides the 
above mentioned results, it was also shown in [22] that A;-connectivity is testable in bounded degree 

''A graph property is hereditary if it is closed under removal of vertices. Therefore, any minor closed property is 
hereditary 

®A tester has one-sided error if it always accepts graphs satisfying the property. 
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graphs, and Czumaj and Sohler [llj have recently shown that a relaxed version of expansion is 
testable with Q{^/n) queries. Finally, Czumaj, Sohler and Shapira [10] have recently shown that 
every hereditary property is testable if the input graph is guaranteed to be non-expanding. Some 
of the arguments in the present paper are motivated by some of the ideas from [lOj . 

One reason for the fact that we understand testing of dense graphs more than we understand 
testing of bounded-degree graphs is that there are structural results "describing" dense graphs, 
primarily Szemeredi's regularity lemma [39], while there are no similar results for arbitrary sparse 
graphs. As is evident from the above discussion, our main result here is the first to show that a 
general (and natural) family of properties are all testable in bounded-degree graphs. 

For more details, see the surveys [H [12l [201 (33 [36] and Section [5] where we discuss another 
model of testing graph properties. 

1.5 Techniques and overview of the paper 

In the next section, we introduce a metric condition for testability. Basically, we define a sequence 
of pseudometrics pr indexed by an integer r G N+, where Pr{G, G') measures the difference between 
G and G' in the frequency of isomorphism types of r-neighborhoods of vertices. We show that if for 
some fixed r > the pr distance between two graph families is positive, then these graph families 
can be distinguished by a tester. 

Next, we state a theorem (Theorem 12. 2p that roughly says that (e, A;)-hyper-finite graphs are 
far away in some pseudometric pR from graphs that are not (e', A;)-hyper- finite, where e' depends 
on e and tends to zero as e \ 0. This result can be deduced from a recent result of Schramm [38] . 
concerning properties of convergent sequences of bounded degree graphs. However, we provide an 
alternative self-contained proof in Section [H In contrast with [38j , our proof is finitary and gives 
an explicit upper bound on R, while the proof in [38] did not supply such a bound. We note that 
convergent sequences of graphs have been previously used in the study of property testing of dense 
graphs [9]. 

How does a tester for a monotone hyperfinite graph property V work? A first guess might 
be the following: as the property is monotone, we may expect to sample enough vertices such 
that with high probability the neighborhood of at least one of them will not satisfy the property, 
thus establishing that the graph itself does not satisfy the property. This works for properties like 
triangle-freeness, but it is not difficult to see, however (see item 2, in the concluding remarks) that 
this approach of aiming for a one-sided error test is bound to fail in our case. The reason is that a 
graph can be far from being (say) planar, yet locally be planar. But still, the local neighborhoods 
of vertices do tell us something about its global properties. For example, in [22], it was shown that 
the local density around vertices can be used to test the property of being cycle-free. By the above 
discussion we must look for another property that the graph must satisfy if it satisfies a minor 
closed property. As it turns out, the right property to look for is hyper-finiteness. 

We now proceed with an informal and not very precise description of how and why our tester 
works. Given an input graph G, the tester first tests for hyper-finiteness. The test for hyper- 
finiteness proceeds by picking a bounded number of vertices at random and exploring the R- 
neighborhood of each one (where the number of vertices chosen and R depend on e). It then checks 
if the observed frequency of the isomorphism types of these neighborhoods looks approximately 
like some (r/, /c)-hyper-finite graph, with appropriate values for r] and k. We do not have an explicit 
characterization of the approximate frequencies occurring in appropriately hyper-finite graphs, but 
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since only an approximation is needed (with known accuracy), there is a finite table listing the 
occurring frequencies, which can be part of the algorithm. The point here is that the table does 
not depend on the size of the tested graph. If the graph fails the hyper-finiteness test, then it is 
rejected. If it passes, then we know that by removing a small proportion of the edges it is broken 
down to pieces of size k. As these small pieces are still far from satisfying the property, we can 
use a bounded number of random samples to actually find a subgraph of the input that does not 
satisfy the property. 

In Section [3] we prove all of the stated result, with the exception of Theorem 12.21 which is 
proved in Section HI In particular, by combining a theorem of Lipton and Tarjan with a result of 
Alon, Seymour and Thomas [2], regarding separators in minor- free graphs, we get that //-minor 
free graphs are hyper-finite. This facilitates a proof of Theorem 11.11 from Theorem 11.21 In Section 
Owe discuss several open problems and conjectures for future research. 

2 A Metric Criterion for Testability 

Let us slightly generalize the notion of property testing, and say that two graph properties A and 
B are distinguishable if there is a randomized algorithm that makes a bounded number of queries 
to an input graph and accepts every graph in A with probability at least 2/3 and rejects every 
graph in B with probability at least 2/3. Then e-testing A is equivalent to taking B to be the set 
of graphs that are e-far from A. 

Shortly, we will define a useful pseudo-metricj^ on the set of graphs of bounded degree d. But 
first, we need to set some terminology. A rooted graph is a pair {G,v), where G is a graph and 

V G V{G). An isomorphism between rooted graphs {H,u) and {G,v) is an isomorphism between 
the underlying graphs H and G that maps u to v. For a vertex v S V{G), we denote by BG{v,r) 
the induced subgraph of G whose vertices consist of the vertices of G at distance at most r from 
V. Consider a rooted graph H and a finite graph G. Let m^{H) denote the number of vertices 

V G V{G) such that there is a rooted-graph isomorphism from (^BG{v,r),v) onto H. (Of course, 
this is often zero.) Set ^J-fiH) := m^{H)/\V{G)\, and define a pseudometric pr by 

Pr{G,G') := j;|/i?(//)-/xf (i/)|, 

H 

where the sum extends over all isomorphism types of rooted graphs. Clearly, the number of terms 
that are nonzero is bounded by a constant that depends only on r and a bound on the degrees in G 
and G'. Observe that fif defines a probability measure on the set of rooted graphs and that pr is 
monotone non-decreasing in r. If A and B are graph families, we define Pr{A, B) := mi{pr{G, G') : 
G G A,G' £ B}. The following proposition gives a metric condition for distinguishability. 

Proposition 2.1. Let A and B be two graph properties having only graphs with degrees at most d. 
If there is some integer R > such that pr{A, B) > 0, then A and B are distinguishable. 

This gives a sufficient condition for A to be testable: if for every e > the set B[e) of graphs 
that are e-far from A satisfies sup^ /3h(^, -B(e)) > 0, then A is testable. 

pseudometric on a set X differs from a metric on X in that a pseudometric is allowed to be zero on pairs [x, y) 
with X ^ y. 
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The converse to Proposition 12.11 does not hold. For example, if A is the collection of graphs 
with an even number of vertices and B is the collection with an odd number of vertices, then A and 
B are distinguishable in the current model in which the number of vertices of the graph is given as 
the input. However, it is not hard to come up with a natural model for property testing in which 
the criterion given by the proposition is necessary and sufficient. 

The primary reason that makes hyper-finiteness so useful for property testing is the following 
theorem. 

Theorem 2.2. Fix d,k £ N-(- and e > 0. Let A be the set of {e,k) -hyper- finite graphs with vertex 
degrees hounded by d, and let B be the set of finite graphs with vertex degrees hounded by d that are 
not (4e log(4(i/e), fc) -hyper-finite. Then there is some R = R{d, k, e) G N+ such that pr{A, B) > 0. 

Theorem 12.21 can actually be deduced from a recent result of Schramm ^8], which deals with 
infinite unimodular graphs. However, in the Section [4] we present a proof that is adapted to the 
finite setting, gives quantitative bounds on R and pr{A., B), and would hopefully be more accessible. 

Let us now give an overview of the proof of Theorem l2.2[ Assuming that G is (e, fc)-hyper-finite 
and that G' is a graph satisfying pji{G,G') < 6 with some appropriate 6 > 0, we will show that 
G' must be (4e log(4 d/e), A;)-hyper-finite. First, since G is (e, A;)-hyper-finite, there is a subset S 
of the edges of G, of size at most e|l^(G)|, that partitions G into connected components of size at 
most k. We then use the existence of S to construct a random S C E{G), where each connected 
component of G\S has at most k vertices and the expected size of S is at most 4 e log(3 d/e)\V{G)\. 
The most important feature of the way we will pick S is that it is local, in the sense that there 
is a finite bound R = R{d, e, fc), such that the probability that an edge e is in 5 only depends on 
the isomorphism type of the pair (i?G(e, i?), e) . Now, if G' is a graph satisfying pji{G,G') < 6, 
then locally it behaves almost exactly like G does. Therefore, choosing a set of edges S' from G' 
using the same process that was used for G, and removing it from G' should also partition G' into 
connected components of size at most k. Furthermore, the expected relative size of S' in G' is close 
to that of S in G, implying that G' is (4e log(4(i/e), fc)-hyper-finite. 

We also have the following corollary which was mentioned in the abstract. 

Corollary 2.3. For every e > and d G N+ there is an R = R{€, d) £ N-|- such that if G and G' 
are finite graphs with vertex degrees bounded by d, G is planar and G' is e-far from being planar, 
then pr{G,G') > l/R. 

The proof of this corollary is an easy application of our arguments below, and will be left to 
the reader. 

3 Most proofs 

Here, we prove Proposition 12.11 and Theorems 1 1 . 2 1 and 1 1 . 1 1 The proof of Theorem 12.21 is postponed 
to Sectional 

Proof of Proposition 12.11 Suppose that for some integer i? > and some positive 5 > 
we have pr{A,B) > 6. In that case, we can distinguish between A and B using the following 
algorithm. Let TC denote the set of (isomorphism types of) rooted graphs {H, v) of radius R around 
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V and maximum degree at most d, and set h := \7i\. Then, given an input graph G of size n, we 
estimate //^(i?) for every G "H, up to an additive error of 5/{2h), with success probabiUty 1 — 
Here, we can apply an additive Chernoff bound to deduce that to this end it is enough to sample 
0{jT log/i) vertices, explore their i2-neighborhood, and compute the fraction of these vertices whose 
neighborhood is isomorphic to {H,v). Let fi^{H) be the estimated values of ij!^{H), and recall 
that when computing_p/j(G, G') we only need to consider rooted graphs {H, v) of radius R. The 
algorithm now checkqfl if there exists a graph Ga S -A. for which ~ — 

If that is the case, the algorithm declares that G belongs to A, and otherwise it declares that G 
belongs to B. Note that since pr{A, B) > 5 and \ jl'^{H) — ^'^{H)\ < 6/2 with high probability, 
the algorithm is unlikely to misclassify graphs in ^ or in i?. □ 

In order to deduce Theorem 11.21 we will also need the following lemma. 

Lemma 3.1. Let V be a monotone hyper-finite graph property with degrees bounded by d, let e > 0, 
and let B be the set of finite graphs with degrees bounded by d that are e- far from being in V. Then 
there is a finite set of graphs Z C V and an R ^ N-(- such that pfi{B,V \ Z) > 0. 

Proof. Fix some G' e B and G eV. Let eo G (0, e/2) be sufficiently smah so that 4eo log(4d/eo) < 
e/2, and let k be such that each graph in V is (eg, A;)-hyper-finite. By Theorem 12.21 we have a lower 
bound on pr{G,G') for some fixed R = R{V,e) if G' is not (e/2, /c)-hyper-finite. So assume that 
G' is (e/2, /c)-hyper-finite. 

Let IC denote the set of all (isomorphism types of) graphs K such that each connected component 
of K has at most k vertices, and a disjoint union of finitely many copies of K is not in V. Let 
IC' denote the minimal graphs in IC; that is, the set of graphs in IC that do not properly contain 
another graph of /C as a subgraph. Then the connected components of any K £ IC' are distinct, 
and hence IC' is finite. 

Assume, with no loss of generality, that the sizes of the graphs in V are unbounded. It then 
easily follows that the graphs in IC' do not have any connected component with just one vertex. 

Let Ti. denote the (finite) collection of all rooted connected graphs with degrees bounded by d 
such that each vertex is within distance at most k from the root vertex. Given a graph Y, let 'H{Y) 
denote the rooted graphs in Ti. that contain y as a subgraph. Suppose first that every K £ IC' has 
a connected component Y such that for every H G H-iY) we have 

p'^\H)<e\n\-'d-''-'/A. (3.1) 

Then it follows that the same is true with IC in place of IC'. Consequently, by removing all the edges 
of G' that are contained in rooted balls := {Bg'{v, k),v) of G' that satisfy ()3.ip and furthermore 
removing a set of edges of size at most e |y(G')|/2 which cuts G' to connected components of size at 
most k, we arrive at a graph in V. By the choice of the right hand side of ()3.ip . it follows that the 
number of edges removed from G' is at most €\V{G')\/2, which then contradicts the assumption 
that G' € B. Consequently, we find that there is some K £ IC' such that for every connected 
component Y K the inequality (j3.ip fails for some H £ 7i(Y). For each such Y let Hy be an 
H G H-iY) for which (13. ip fails, and let Q := {Hy : y is a connected component of K}. 
Suppose now that 

Pk{G,G') <e\n\-U-''-^/8. 

^To do this, the algorithm does not have to actually search all possible graphs. It can just store a (5/4-net of the 
set of all possible /i-tuples of values of fi'^^(H), taken over all graphs Ga G A. This is a finite list. 
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Then, in particular, we find that 

Since G £ V and a disjoint union of finitely many copies of K is not in V, this implies an upper 
bound on depending on K. (In fact, depending on how many disjoint copies of K are 

sufficient to make a graph that is not in V.) However, we can maximize over the finitely many 
choices of X S /C', to obtain an upper bound that depends only on V and e. Thus, except for a 
finite set of graphs G £ V, we have a lower bound on Pk{G\ G) or a lower bound on pr{G' ,G). As 
Pr is monotone non-decreasing in r, this proves the lemma. □ 

Proof of Theorem 11.21 Fix some e > 0, and let B be the set of graphs that are e-far from V. Let 
Z be the finite collection obtained from Lemma |3. II with B and V as above. Then by Lemma |3. II 
and Proposition 12.11 B and V\Z are distinguishable. Since Z is finite, the tester can easily test if 
the tested graph belongs to Z. Thus, we get a tester for V, which first tests for membership in Z, 
and then uses the tester which can tell apart B from V \ Z. This completes the proof. □ 

Recall the Lipton-Tarjan planar separator theorem ^29j, which says that every planar graph G 
has a set of vertices Vq of size 0(|y(G)|^/2), such that every connected component of G \ Vq has at 
most 2|y(G)|/3 vertices. Lipton and Tarjan used this theorem to show that the set of planar graphs 
with degree bounded by d is hyper- finite [30, Theorem 3]. Alon, Seymour and Thomas \T\ proved 
that the planar separator theorem as stated above holds more generally in the class of if-minor 
free, where H can be any finite graplj^. The proof of [30^ Theorem 3] therefore gives the following 
result. 

Proposition 3.2. Fix d £ N-(-. Let H be a finite graph, and let A be the set of graphs that are 
H -minor free and have degrees bounded by d. Then A is hyper-finite. □ 

Proof of Theorem 11.11 Since being i/-minor free is a minor-closed property, it is enough to 
prove the second claim of the theorem: that minor closed graph properties are testable. Let V 
be a minor closed graph property. If V includes all graphs, then it is clearly testable. Otherwise, 
suppose that H ^ V. Then V is ff-minor free. By Proposition 13.21 V is hyperfinite. Since V is 
monotone, the theorem now follows from Theorem 11.21 □ 

4 Proof of Theorem [22] 

Let G £ A. Since G is (e, A;)-hyper-finite, there is a set 5 C E{G) such that \S\ < €\V{G)\ and 
all the connected components of G \ 5 are of size at most k. Fix such a set S. With no loss of 
generality, we assume that none of the edges of S connects two vertices from the same connected 
component of G \ 5. The proof strategy is to replace the set S with a random set S that has 
no long-range dependencies and the probability of an edge being in S depends only on the local 
structure of G near the endpoints of the edge. It will then be easy to see that a similar S exists for 
Go if Pr{G, Go) is small and R is large. 

^The implicit constant in the O(-) notation above is explicit in |29j . Also [2] gives an estimate for this constant, 
which naturally depends on H. 
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Let us first introduce some notation. If IT is a subgraph of a graph H and K' is a subgraph 
of a graph H', then we say that the pair {K,H) is isomorphic to the pair {K',H') if there is an 
isomorphism of H onto H' that takes K onto K' . Isomorphisms of triples {H,K,J), K,J C H, 
are similarly defined. For a graph G, we define /C(G) as the set of vertex sets K C V{G) of size 
at most k which span a connected graph in G. Given the set S that disconnects G into connected 
components of size at most k, we let ICs{G) consist of those elements of fC{G) that span a connected 
component in G \ S*. We stress that the elements in 1C{G) are only required to span a connected 
subgraph in G, while the elements of ICs{G) are required to span a maximally connected subgraph 
in G \ S". In what follows it will be convenient to sometimes identify an element K G K{G) with 
the (connected) graph spanned by K. 

Given a subgraph H C G and r G N, let Nr{H) denote the subgraph of G induced by the 
vertices at distance at most r from H. Let N be some connected graph and let be a connected 
subgraph of N of size \V{K)\ < k. For r E N+, let Tr{K,N) denote the set of K' G /C(G) such 
that {K',Nr{K')) is isomorphic to (K^N). Set 

PriK,N):={ ^i^r{K,lM)^iO, 
I 1 otherwise . 

li K £ JC{G), we abbreviate, 

Pr{K):=pr{K,Nr{K)). (4.1) 
For V £ V{G), let /C(t') denote the elements of /C(G) that contain v. Set 

g^(7;) := J2 Pr{K) . 
K(^K.(v) 

Intuitively, Pr{K) is an approximation of the conditional probability that K £ /Cs(G) given the 
isomorphism type of the pair [K, Nr{K)^ . Since there is exactly one K G JC{v) n }Cs{G), we would 
expect that qr{v) is close to 1 for most vertices when r is large. In the following, we will need the 
fact that for some not too large R, very few vertices have qR{v) small. To this end, we have the 
following lemma. 

Lemma 4.1. There exists some finite Ri = Ri{d,e,k) and some R £ N+ H [1,-Ri] such that the 
number of vertices v G V{G) with qR{v) < 1/2 is at most e \V {G)\ / {2 d) . 

We stress that R itself may depend on G and S, but it is bounded by which is not allowed 
to depend on G and S. We postpone the proof of the lemma and continue with the proof of 
Theorem 12.21 Let IC' denote a random subset of /C(G), where each K G /C(G) is in K,' with 
probability min(2 log{2 d/ e) pji{K) , l), independently, where R is as provided by Lemma l4.ll Let 

S':= U dK, 

KeK' 

where dK denotes the set of edges of G that are not in K but neighbor with some vertex in K. 
Let W denote the set of vertices that are contained in some K £ fC', and S" denote the set of all 
edges of G having both vertices in V{G) \ W. Finally, define S := S' U S" . Clearly, the connected 
components of G \ 6* are of size at most k. We now have to estimate E\S\. 
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Let us start by estimating ElS"']. Consider an arbitrary vertex v G V{G) such that qR{v) > 1/2. 
Then 

P[v ^W]= Yl {1-F[K G /C']) 

<exp(- 2\og{2d/e)pR{K)) = {e/{2d)f''''^''\ 

KeK.{v) 

Therefore, if qB.{v) > 1/2, we have P[u ^ < e/{2d). On the other hand, the number of vertices 
satisfying qR{v) < 1/2 is at most e \V{G)\/(2d), by Lemma HTl Thus, we have 

BWiG) \ W\ < + J:{p[. ^W]:ve ViO, qniv) > < . 

Since every edge in S" is incident to a vertex in V{G) \ W, we get E|S"'| < e 

We now estimate E\S'\. For K G IC{G) set L{K) := rK(i^, iVK(K)) . Observe that K' G L(/C) 
if and only if L{K') = L{K). Let Ki,K2, ■ ■ ■ ,Km be a set of elements in /C(G), one from each 
equivalence class of the relation L{K') = L{K). Set Li := L{Ki). Note that pr{K) = p^^Ki) for 
every K £ Li, and since R> also \dK\ = \dKi\ for every K G Li. Therefore, the definition of pR 
gives 

By summing over i, we get, 

PR(K)|aK|=j; j;p^(i^)|5i^|=E E |9i^|= |5i^| = 2|5|. 

K£k:(g) i KeLi i LiniCs{G) KeiCs(G) 

As each element of /C(G) is chosen with probability min(2 log(2 d/e) pii{K), l) to be in /C', we infer 
that 

E|5'| < 4 log(2d/e) |5| < 4e log{2d/e) \V{G)\ . 
Putting this together with our previous bound on E|S"'|, we obtain 

E|5| < 4e log(3(i/e)|y(G)| . (4.2) 

Now suppose that Go is any finite graph with degrees bounded by d. We can define a random 
set of edges Sq C E{Gq), as follows. Let /Cq be a random set of elements of /C(Go), where each 
K G /C(Go) is placed in /Cq with probability min(2 \og{2d/e)pR{K,NR{K)),l^, independently. 
(Here, Nr{K) refers to the neighborhood in Go, of course.) Define S'q := Ui^eAC,', ^'^'^ define Sq 
based on Go and Sq as S" was defined based on G and S' . 

In order to estimate Ej^ol, we briefly consider the situation in G again. Note that for every 
vertex v G V{G) the expected number of edges that are incident with v and belong to S is completely 
determined by its neighborhood of radius r = R -\- k + 1, or, more precisely, by the isomorphism 
type of the rooted graph H{v) := {B{v, r),v). Let tn be this expectation when H is isomorphic to 
H{v). Recall that fi^{H) ■ \ V{G)\ is the number of vertices v G V{G) such that H{v) is isomorphic 
to H. Then we can write 

^E\Sl_^ G.rr. 
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Similar considerations apply to Go and Sq, and hence 



V ^ ^ 



H H H 

2E|5| 



\y{G)\ ^ 

where we have used the fact that tff < d as Gq is of bounded degree d. Thus, Gq ^ B if 

Pr{G,Go) <{8/d)elog{4/3) . (4.3) 

This completes the proof. □ 

Proof of Lemma 14.11 Let o be a uniformly random vertex chosen from V{G). Let ^ be a set 
of finite rooted graphs, which has exactly one representative for each isomorphism class of rooted 
graphs. For r G N, let Hr denote the random element from Q that is isomorphic to the rooted 
graph {^B(o,r), oj . Let Z denote the set of pairs {H,K) such that H £ Q and i^T is a connected 
subset of V{H) of cardinality at most k which contains the root of H. It is important to note that 
unlike Q, which does not contain two isomorphic copies of the same graph, the set Z may contain 
two isomorphic pairs {H,K) and {H,K'). 

For Z = (H, K) G Z, we would like to define Pr{Z) as the conditional probability given Hj. that 
H = Hk and the isomorphic image of K in B{o, k) is in lCs{G). But here is a slightly annoying 
technical point. If has nontrivial automorphisms, then there is no unique isomorphic image of 
K in B(o,k). For this reason, given o and r, we take a random isomorphism 7.^, chosen uniformly 
among all rooted-graph isomorphism from Hr onto [B{o,r),o^, with independent choices for each 
r. Let denote the sequence of maps (jj-li ° Jj '■ j = 0,1, . . . ,r — 1). Intuitively, /3r tells us for 
each r' < r how Hr' sits inside H^. 

For Z = {H,K) e Z let Az denote the event that Hk = H and 7fc(i^) € ICs{G), and set for 
every Z £ Z 

Pr{Z):=P[Az\Hr,f3r]. (4.4) 

Clearly, exactly one of the events Az, Z £ Z, holds. Therefore, 

J2Pr{Z) = l (4.5) 
z^z 

holds for every r E N. S inc6 (-f^7-,/?7-) Ccin be d-ctGrniiriGci from (^Hf-^i^ P'p-\-i^') it is clccir tlicit 

Po{Z), Pi{Z), p2{Z),... 

is a martingale for any fixed Z £ Z. (This is always the case when conditioning on finer and 
finer cr-fields.) Since Pr{Z) G [0,1] and B[priZ)] = P[Az] = Po{Z), we have B[pr{Zf] < Po{Z). 
Therefore, the variance of Pr{Z) is at most pq{Z). 
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Now recall that for a martingale Mi , M2 , . . ■ we have the so called "orthogonality for martingale 
differences", which says that 



n-l 



J^E[(M,-+i - M,f] = E[m2] -B[Mf] < Var(M„). 



(4.6) 



We apply this to the sequence pki{Z), i = 0,1,2, ... , (which is also a martingale), sum over Z £ Z 
and use our above bound on the variance ol pr{Z), to obtain for every j £ N-|- 

E ^[{P(^+l)k{Z)-p^k{Z))'] < E Var(p,,(Z)) < ^ po{Z) = 1 . (4.7) 

It is now time to make the connection between pr and pr- Fix some i £ N+ and Z = {H, K) G Z. 
Suppose that Hk = H. Then 

l{i+i)k{H(i+^)k) = B[o, {i + l)k) D Nik{-fk{K)) D lik{Hik) = B{o,ik) , 

where, as before Nr{U) denotes the r-neighborhood of U . Let Y denote the isomorphism type of 
the triple [Nki['yk{K)) ,^k{K) , o) . Then clearly Y can be determined from Z, //(j+ij^ and 
Moreover, Y determines the isomorphism type of {Hik^'^ik ° lk{K),Oik^ and therefore determines 
Pik{Z). Consequently, the three term sequence 



Pik{Z), l{Hk=H}^[^Z I Y], p(i+i)fc(Z) 

is a martingale. We abbreviate Pik{Z) := P [Az \Y~\. It is easy to see that 



(4i 



Pik{Z) 



Therefore, 



Qik 



UkilkiK)) H = Hk 
[0 H^Hk 

(o) = ^PikiZ). 

zez 



(4.^ 



Since the three random variables pik{Z), pik{Z), P[i+i)k{Z) form a martingale, we get from (14.6 
that 



E 



{v{i+i)k{z) - Pik[z)y 



= E 
> E 



{P(i+i)k{Z) -pik{Z))' 



{pik{z) -Pik{z)y 



+ E 



[pik[z) -pik{z)y 



(4.10) 



Recall the definition of IC{v) from just below (j4.ip . There is clearly a finite upper bounco t 
t{k,d) < (f^ on |/C(w)|. Observe that 

\{{H,K)£Z:Hk = H]\ = \lC{o)\ <t. 



^"One way of seeing this is to observe that every tree of size k can be realized as a closed walk of length 2k; just 
double each edge and take an Euler tour. Therefore, the number of sub-trees of size at most k that contain the root 
of a graph, whose maximum degree is d, is at most (f'. 
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Thus, there are at most t different Z G Z for which Pik{Z) —pik{Z) / 0. Cauchy-Schwarz therefore 
gives 

tY,{pik{Z) - Vik{Z)f > {^Y,Vik{Z)-Y,Pik{Z)f . (4.11) 

zez z^z z<^z 

Taking expectation on both sides and using hnearity of expectation we get 

t J^E[(p,fc(Z)-pife(Z))'] >E[(^p,fe(Z)- ^Pife(Z))']. (4.12) 

ZeZ Zt^Z z&z 

We now sum (|4.10p over ah Z G Z and apply (|4.12p and then (fi3]) and (ji?9|) . to get 

i ^E[(p(i+i)fc(Z)-p,fc(Z))'] >E[(^i^,fe(Z)- ^Pife(Z))'] =E[((?,fc(o)-l)']. 

Z<^Z ZeZ z<^z 

Now sum over i and apply (j4.7p . to obtain 

i-i i-i 

t>^E[(g,,(o)-i)'] = iy(G)ri E(^^'^w-i)'- 

i=l t)ey(G) i=l 

To complete the proof we can take 

j = A; + 8dtA;/e < lOfcd^'^+i/e , (4.13) 
and the same choice works for R\ in the statement of the lemma. □ 

5 Concluding Remarks and Open Problems 

1. Let us give an explicit upper bound for the dependence on e of the number of queries required 
to test the property of being iif-minor free for any fixed connected graph H. We assume here 
that the maximum degree d is a constant and H is fixed. Given a graph G and e > 0, let us 
choose eo and k as in the proof of Lemma ISTTl It is clear that 1/eo = poly(l/e) and the result 
of [2] together with |301 Theorem 3] give that k = poly(l/e). By Theorem 12.21 and its proof 
we know that if G is not (e/2, /c)-hyper-finite then for every G' that is i7-minor-free we have 
Pr{G, G') = VL{e) for some fixed R = l'^'^^^ je = 2?°^^^^/''). Here we are using the bounds that 
appear in (j4.3|l and (I4.13p . The algorithm first tries to distinguish between the case that the 
input is (eo, /c)-hyper-finite from the case that it is not (e/2, A;)-hyper-finite. By the proof of 
Proposition 12.11 this can be done with poly(/i/e) queries, where h is the number of graphs of 

90(H) r,2P°'>'(l/'=) 

bounded degree d and radius R. Clearly h = 2 =2 .If the algorithm finds that 

G is not (eo, A;)-hyper-finite it rejects G. Otherwise, the algorithm samples poly(l/e) vertices, 
and for each vertex it explores its neighborhood of radius k. This requires 2^^^^^ = 2^°^^^^^^^ 
queries. If any of the neighborhoods explored is not i7-minor free the algorithm rejects, 
otherwise it accepts. The algorithm clearly accepts G with high probability if it is //-minor 
free, as such a G is (eo, A;)-hyper-finite. If G is e-far from being //-minor free and not (e/2, A;)- 
hyper-finite then it is rejected with high probability in the first step. If not then it is easy 
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to see that (7 (en) of its vertices belong to connected subgraphs of G of radius at most k that 
are not ff-minor free. Hence, G is rejected in the second step. 

If H is not connected, then after testing hyperfiniteness, we can test for being H'-Toiaoi free 
for each connected component H' of H. It is then easy to determine if G is ff-minor free or 
not. Similar reasoning applies in the case of a property determined by a finite list of forbidden 
minors. By the Graph- Minor Theorem [34], this includes all minor closed properties. Thus, 

2Poly(l/e) 

every minor closed property can be tested using 2 queries. 

2. Another model of property testing is when the number of edges is arbitrary, and the error is 
relative to the number of edges [32]; that is, a graph with m edges is e-far from V if we have 
to modify em of its edges to get a graph satisfying V. This raises the following: 

Problem 5.1. What is the query complexity of testing minor-closed properties in graphs with 
hounded average degree? 

It is clear that o(n^/^) queries are not enough, for they will not suffice to detect a clique on 
of the vertices of the graph. 

3. As we have argued in Subsection II. 4^ minor closed properties are trivially testable in the 
dense graph model using 0(l/e) queries even with one-sided error. The intuition is that if a 
hereditary graph property is testable, then the reason must be that one can find a "proof" 
that a graph does not satisfy the property, in the form of a subgraph that does not satisfy 
it (which implies that the graph does not satisfy it). This intuition turns out to be correct 
in dense graphs [3]. However, it is easy to see that in the bounded degree model, we cannot 
expect to have such proofs. For example, if G is a bounded degree expander of girth r2(logn), 
then for any fixed H with at least one cycle, we have that G is f](l)-far from being iJ-minor 
fref0 when it is sufficiently large, but on the other hand, every subgraph of G of size o(logn) 
is a tree, and in particular //-minor free. Thus if H is not a tree, then one cannot test 
//-minor freeness with o(logn) queries and one-sided error. In fact, a much stronger Q.{y/n) 
lower bound can be deduced by adapting an argument from [22j. We raise the following 
conjecture, stating that the Q.{y/n) lower bound is tight. 

Conjecture 5.2. For every H , being H -minor free can he tested in the hounded degree setting 
with one-sided error and query complexity 0{y/n). 

If the conjecture is true, then the Graph Minor Theorem [3l] implies that the same is true 
for any minor-closed graph property. 

4. As we have briefly mentioned earlier, it does not seem like our approach can lead to testing 
algorithms for the property of being //-minor free whose query complexity is polynomial in 
1/e. It seems interesting to further investigate the possibility of coming up with such a tester. 

5. Our main result here is that we can distinguish in constant time between graphs satisfying 
a minor closed property, from those that are far from satisfying it. Can we also estimate 
in constant time the fraction of edges that need to be removed in order to make the graph 
satisfy the property? 

^This follows, for example, from the separator theorem of [2] for //-minor free graphs, mentioned earlier. 
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6. It would be interesting to obtain an explicit description of the difference between the frequen- 
cies of local neighborhoods that one sees in (e, /c)-hyper-finite graphs versus graphs that are 
not (e', A;)-hyper-finite, where e <C e'. 

Acknowledgements: We would like to thank David Wilson and Gabor Elek for helpful discus- 
sions. 
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